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Abstract 


Recently we have presented the analytical relationship between choice probabilities, noise corre¬ 


lations and read-out weights in the classical feedforward decision-making framework (Haefner et al. 


(2013)). The derivation assumed that behavioral reports are distributed evenly between the two 


possible choices. This assumption is often violated in empirical data especially when computing so- 
called grand CPs combining data across stimulus conditions. Here, we extend our analytical results 
to situations when subjects show clear biases towards one choice over the other, e.g. in non-zero 
signal conditions. Importantly, this also extends our results from discrimination tasks to detection 
tasks and detect probabilities for which much empirical data is available. We find that CPs and DPs 
depend monotonously on the fraction, p, of choices assigned to the more likely option: CPs and DPs 
are smallest for p = 0.5 and increase as p increases, i.e. as the data deviates from the ideal, zero- 
signal, unbiased scenario. While this deviation is small, our results suggest a) an empirical test for 
the feedforward framework and b) a way in which to correct choice probability and detect probability 
measurements before combining different stimulus conditions to increase signal/noise. 


1 Introduction 


Understanding the contribution of sensory neurons to perceptual decisions has been a long-standing goal 


of systems neuroscience (Parker and Newsome (1998)). One popular approach to answer this question 
has been to compute the correlation between individual neuronal variability of those sensory neurons and 
the choice of the subject in perceptual decision-making tasks. In discrimination tasks, these correlations 
are quantified as ’choice probability’ (CP), and in detection tasks, as ’detect probability’ (DP) - reviewed 
in Nienborg et al. (2012). Unfortunately, the interpretation of CPs and DPs is complicated by the fact 


that they not only depend on how individual neurons are being read-out, but also on how these neurons 
are correlated with each other (Shadlen et al. (1996])). Recently, we presented the analytical relationship 
between CPs, read-out weights and noise correlations for the case of a two-alternatives discrimination 
task assuming that the subject’s decisions were unbiased. Here, we extend this work to include biased 
decision-making when trials are not evenly split between the two choice, and to detection tasks. 
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2 Results 


We assume a linear read-out of the responses r = (ri,r 2 ,... r n ) of n sensory neurons by a hypothetical 
decision neuron r D = w T r = Yji=i Wiri - We further assume that a decision is made in favor of either 
choice 1 (of 2 choices, in a discrimination task), or in favor of ’detect’ or ’ignore’ (in a detection task) 
when the activity of the decision neuron r D exceeds a given threshold 9: choice 1 if r D > 9 and choice 2 if 
r D < 9. In this mathematical framework discrimination tasks and detection tasks are equivalent and, for 
simplicity, in the following our notation will use ’choice 1’ and ’choice 2’ only. Equally, we will use CP to 
denote both CPs and DPs. We further assume that the response of each neuron is normally distributed 
around a mean given by its tuning curve, fi(s) for a particular stimulus value s: r, = /,(s) + r)i where 
7/ ~ 7V(0, C) with covariance matrix C. 

This allows us to compute the choice-triggered response distribution for each neuron, p(ri\r D > 9) and 
p(rj|r D < 9), and the choice-triggered average (CTA), i.e. the difference in the mean responses preceding 
choice 1 and choice 2, respectively (Methods): 


Achoicer := <p(ri|r D > 9)) - (p(n |r D < 9)) 

1 / 1 Cw 




with p = $ 


P (1 — P) V 2tt Vw t Cw 
w T f(s) — 6 


V w T Cw 


: 0:1 


where p is the probability with which the subject chooses choice 1 over choice 2. 
CPs can be computed as 


rOD 

CPi = d xp{n = x\r° < 9) d yp(n = y\r° > 9) 

J — 00 Jx 


For p = 0.5 this integral is identical to that for CPs in the unbiased case. Hypothesizing an equivalent 


relationship between CTA and CP for the general case (Haefner et al. (2013)), suggests this approximate 
formula: 


CP‘*| + — /-T- 

2 ?r y CmW t Cw 


(Cw), e xp{-|[$ Hp)] 2 } 

4p(l-p) 


(1) 


In simulations we have confirmed that the error induced by the approximation is below 0.5% for p < 0.9, 
which is more than a magnitude smaller than current empirical error bars. 

Interestingly, this approximation becomes better as p deviates from 0.5 and is therefore better than 


the first-order approximation presented for unbiased CPs in Haefner et al. (2013). This is presumably 
due to the fact that the choice-triggered distributions, p(r\r° < 9) and p(r|r D > 9), are maximally non- 
Gaussian for the balanced case of p = 0.5 and become progressively more Gaussian as p deviates from 
0.5, and that for the Gaussian case the CTA-CP relationship is linear. 

Figure [l] plots the bias-dependent term that captures the ratio of the CP in a biased condition to the 
CP in an unbiased scenario. The bias is quantified by the probability of choice 1, p, ranging from 0 to 1, 
with 0.5 being unbiased. 

While the read-out strategy for detection tasks is currently unknown, equation ([!]) implies that for 
2-pool models, i.e. read-outs that rely on the difference between the responses of two neuronal pools, the 
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Figure 1: Ratio between actual CPs and those for the balanced case of an even number of choices. 


insights gained from discrimination tasks also hold (Haefner et al. (2013)). In particular, in a model in 
which the averaged responses of two large homogenous neural pools (all neurons have the same response 
variance) are subtracted from each other, DPs only depend on the difference between the average noise 
correlation coefficient between neurons in the same pool and the average correlation coefficient between 
neurons in different pools. This means that DPs in this case depend not on the overall mean of the noise 
correlations but on their ’structure’, i.e. the difference between within and across pool correlations. In 
contrast, in the equivalent 1-pool model, where the average neuronal activity of a neuronal ’pref’ pool is 
compared to an internal threshold 9, the DP of neurons in the pref pool depends on the average noise 
correlation within that pool while the DP of other neurons depends on their average correlation with the 
pref pool neurons. We obtain for large populations (where the diagonal terms can be neglected): 


CP 


CP pre f 


CPnuIl 


1 + V2 1 ( C pref C diff \ /1\ exp{-|[$ Mp)] 2 } 

2 7T -y/CnW T Cw \C diff C nul iJ yoj 4p(l~p) 

1 V2 _exp{ —i^- 1 ^)] 2 } 

2 + n V^ 4p(l-p) and 

1 | V2 c diff exp{-i [$-i(p)] 2 } 

2 7T -^/Cpref 4p(l-p) 


and therefore 


where c pre f is the average correlation between neurons within the pref pool and c d ;ff is the average 
correlation between neurons that are in different pools. 


3 Discussion 

We have extended the known analytical relationship between choice probabilities, read-out weights and 
noise correlations in the linear feedforward framework to situations where the subject shows a bias 
towards one of the two choices. This, more general case, encompasses discrimination tasks where CPs are 
computed for non-zero-signal trials, and detection tasks and detect probabilities. While this generalization 
is only approximate, the error due to this approximation is much smaller than current measurement 
errors. Furthermore, like the approximation previously presented for the balanced case, it allows for an 
easy intuition into how observable CPs depend on read-out weights, and noise correlations. Notably, it 
differs from the balanced CP by a factor that only depends on the probability with which the subject 
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chooses choice 1 (or detect) over choice 2 (or miss) - a value that is easily inferred from the data. 

Prior work on the effect of choice bias on CPs (Kang and Maunsell (2012)) has found that CPs are 
underestimated when grand CPs are computed by z-scoring and combining responses across different 
signal conditions. This finding is different and not in contradiction with the result presented here, that 
actual CPs should increase with choice bias in the feedforward framework. 

The dependency of CP on the choice bias that we report has three immediate consequences. One, 
it suggests an empirical test for the feedforward, linear decoding, decision-making framework that is 
the basis for the presented relationship. Two, it suggests a way in which to compare CPs measured 
at different levels of choice bias, for instance in order to combine signal trials and zero-signal trials to 
increase signal-to-noise in the CP measurements (so-called ’grand CPs’, Nienborg et al. (2012)). Third, 
and most importantly, it extends the benefits of an analytical relationship from discrimination tasks to 
detection tasks for which much physiological data is available (reviewed in Nienborg et al. (2012)). 
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4 Methods 

The choice-triggered response distribution for each neuron is given by: 

„ (r | r D > m = ^ D> W~) 

n ’ p(rD > 9) 

= p~ 1 p(r D > 6\n)p(ri) 

p ( Hr p < m = P( rU < OlnWi) 

P[rilr U) p(rD < 9) 

= (1 -p) _1 p(r D < 9\ri)p(n) 

where p = 1 — $[0 : w T f(s);w T Cw] = <f>[w T f(s) — 9 : 0;w T Cw] is the fraction of trials during which 

the subject reported choice 1. p = 0.5 for the unbiased case. We defined (/)(n : fi(s);Cu ) to be the 
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Gaussian probability density over r, with mean /,(s) and variance Cu. Equivalently, $ is defined to be 
the cumulative Gaussian. It follows: 


p(r, ; |r D > 9) = p 1 (j)(r i : /»(s); Cu) 

1 - $ 
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From that the choice-triggered averages can be computed: 

rOD 

j drip(ri\r D > 9) 
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We can solve this integral using ip'(x ) = —x<j)(x) and the following auxiliary relationship: 
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Defining 


a = 
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Equivalently, we find: 
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Hence 


Achoicer := <p(r.i|r D > 6*)) - <p(r, ; |r 
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